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(54) Method and system for generating characterizing information descriptive of a selected text 
block 



(57) A method and system for characterizing blocks 
of printed material. A digital image of printed material, 
such as an address block, on an object is obtained, and 
the image is processed to abstract characterizing infor- 
mation descriptive aspects of the other printed material. 
The characterizing information is combined with other 
information, such as postal information and the com- 
bined information is then cryptographically authenticat- 
ed with a digital signature or the like. An indicium rep- 
resentative of the authenticated information is then 
printed on the object. The object's relationship to the in- 



dicium can be verified by regenerating the characteriz- 
ing information from the other printed material and com- 
paring the regenerated characterizing information with 
characterizing information recovered from the indicium. 
Thus, copies of the indicium cannot easily be used, with- 
out detection, on other objects which do not include the 
other printed material. A combination of algorithms is se- 
lected from a predetermined group of characterizing al- 
gorithms by determining an estimate for the robustness 
of each algorithm and selecting a combination having 
high overall robustness. 
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Description 

[0001] The subject invention relates to the problem of providing a robust, compact characterization of a block of 
printed text which will distinguish the selected block of text from other such blocks. More particularly, it relates to the 
problem of providing an image-based characterization of a printed address block which can be incorporated into a 
digital postal indicium. (By "robust and compact" herein is meant information which is small enough in quantity to be 
incorporated into postal indicia yet will identify a text block, and distinguish it from other text blocks, with sufficient 
reliability to deter "rubber stamp" counterfeiting; despite errors introduced by the printing and/or scanning processes.) 
[0002] Postage metering systems account for postage and other values such as parcel delivery service charges and 
tax stamps, and print indicia representative of such values as proof of payment. To protect against counterfeiting of 
indicia, modern digital postage metering systems use encryption technology. The postage value and other information 
relating to an indicium, preferably, are digitally signed or otherwise cryptographically authenticated, and the information 
and signature are incorporated into the digital postal indicium. 

[0003] Digital postal indicia using encryption technologies are extremely secure. In general, without knowledge of 
the proper encryption keys, it is essentially impossible to produce a counterfeit digital indicium, However, digital indicia 
are subject, as are all postal indicia, to "rubber-stamp" counterfeiting when a valid indicium is scanned and reproduced 
on multiple mail pieces. To prevent such "rubber-stamp" counterfeiting, it is known to incorporate information from the 
address block of the mail piece into the postal indicium. Because space on an envelope is limited, typically only a small 
portion of the information in the address block will be incorporated into the indicium. 

[0004] In Figure 1, prior art mailing system 10 includes address printer controller 12, address printer 14, postage 
meter 16, and indicia printer 20. Address printer controller 12 receives address information from a data processing 
system (not shown), generates a bitmap representative of the nominal, or "pristine", image of the address block, and 
controls address printer 12 to print address block A, representative of the address, on envelope E. Meter 16 receives 
postage information, and other information, from the data processing system. Meter 16 also receives characterizing 
information descriptive of block A from address printer controller 12. The information received can be either text-based 
or image-based. Text-based information is descriptive of the words or characters making up to the address, (e.g., ASCII 
code) while image-based information is descriptive of the actual printed image in the address block. Meter 1 6 combines 
the characterizing information with the postage value and other information, typically digitally signs the combination, 
generates a bitmap representative of an indicium including the digitally signed combination, and controls indicia printer 
20 to print indicium IN on envelope E. When the mail piece is sent to a postal service location the address block can 
be scanned again, and the information regenerated from the scanned address block compared to information recovered 
from indicium IN, without the need to communicate with the remote mailing system; thus tying indicium IN to the par- 
ticular mail piece. (Note that since indicium IN is cryptographically linked to the address on the mail piece, printer 20 
need not be a secure printer; but can be a general purpose printer which can be controlled by other devices for other 
uses.) Commonly assigned, U.S. Patent application publication number 2004/0128254 entitled System And Method 
For Mail Destination Address Information Encoding Protection And Recovery In Postal Payment in the name of Leon 
A. Pintsov discloses a system similar to that of the Figure 1 using text-based characterizations of the address block. 
[0005] While useful for its intended purpose, problems remain with the system of Figure 1 and similar systems. It 
has proven difficult to reliably recover textual information from address blocks during the validation process using 
available optical character recognition (OCR) techniques. Attempts to increase the robustness of text-based systems 
by incorporation of additional information and/or the use of error correcting codes has resulted in undesirable increases 
in indicia size and computational complexity. Use of image-based characterizing systems, such as those described in 
the above mentioned co-pending application publication number 2004/01 28254 has been proposed and is believed to 
substantially overcome some of the problems of text-based systems. However, to date, no characterizing algorithm is 
known to be sufficiently robust for all possible applications. This problem is exacerbated by the variation in robustness 
of characterizing algorithms with respect to the particular text block to be characterized. This is an object of the present 
invention - to provide a robust and flexible method and system for characterizing particular text blocks. 
[0006] The above object is achieved and the disadvantages of the prior art are overcome in accordance with the 
subject invention by a method and system for generating characterizing information for a selected block of printed 
material (such as an address printed on an envelope or other mail piece), in which said printed material is to be scanned 
from an object and compared with said characterizing information at a location distant from where said block is printed. 
The system of the subject invention is controlled in accordance with the method to: a) print said block on an object; b) 
determine estimates of robustness for each algorithm in a predetermined set of algorithms; and c) select, as a function 
of said estimates, a combination of descriptors generated by a corresponding combination of said algorithms as said 
characterizing information. 

[0007] In accordance with one aspect of the subject invention the estimates are determined by: a) filtering a pristine 
digital image of said block of printed material with a print/scan filter to create a filtered image, said print/scan filter 
simulating the expected transformation of said pristine image by printing and scanning processes; b) applying each 
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algorithm from said predetermined set of characterizing algorithms to said filtered image to generate a plurality of 
corresponding second characterizing information descriptors for said filtered digital image; and c) for each algorithm 
from said predetermined set of characterizing algorithms, comparing corresponding said first and said second descrip- 
tors to determine said estimates of robustness. 

5 [0008] In accordance with another aspect of the subject invention the estimates are determined by: a) filtering a 
pristine digital image of said block of printed material with a print/scan filter to create a filtered image, said print/scan 
filter simulating the expected transformation of said pristine image by printing and scanning processes; b) further filtering 
said filtered image with one or more defacing filters, said defacing filters simulating simulate blots, smudges, failure of 
print elements or scanner sensors, or other, similar occasional events which can not easily be incorporated into said 

10 print/scan filter to create one or more defaced images; c) applying each algorithm from said predetermined set of 
characterizing algorithms to said filtered image and to said one or more defaced images to generate a plurality of 
corresponding second characterizing information descriptors for said filtered digital image and one or more pluralities 
of defaced image descriptors corresponding to each of said one or more defaced images; and d) for each algorithm 
from said predetermined set of characterizing algorithms, comparing corresponding first characterizing information 

15 descriptors with corresponding second characterizing information descriptors and with each of said one or more cor- 
responding defaced image descriptors to determine said estimates of robustness. 

[0009] In accordance with another aspect of the subject invention the selected descriptor is one of the second de- 
scriptors. 

[0010] In accordance with yet another aspect of the subject invention an indicium including the characterizing infor- 
20 mation is verified by; a) scanning images of said indicium and said other printed material from said object; b) inputting 
a combination of first descriptors comprising the characterizing information from said indicium image; c) identifying 
characterizing algorithms used to generate said first descriptors; d) applying said identified algorithms to said image 
of said other material to generate second descriptors; e) comparing said first and second descriptors; and.f) if said first 
and second descriptors do not match, diverting said object for further inspection. 
25 [0011] Other objects and advantages of the present invention will be apparent to those skilled in the art from con- 
sideration of the detailed description set forth below and the attached drawings. s. 
[0012] The present invention is illustrated by way of example, and not by way of limitation, in the figures of the 
accompanying drawings and in which like reference numerals refer to similar elements and in which: 
[0013] Figure 1 shows a schematic block diagram of a prior art mailing system. 
30 [0014] Figure 2 shows a schematic block diagram of a mailing system and a verification system in accordance with 

the subject invention, ^ 
[0015] Figure 3 illustrates a method for abstracting characterizing information descriptive of an address block from 
an image of the address block. 

[0016] Figure 4 illustrates another method for abstracting characterizing information descriptive of an address block 
35 from an image of the address block. 

[0017] Figure 5 illustrates another method for abstracting characterizing information descriptive of an address block 
from an image of the address block. 

[001 8] Figure 6 shows a flow diagram of the operation of the mailing system of Figure 2 in determining the robustness 
of algorithms for abstracting characterizing information in accordance with one embodiment of the subject invention. 
40 [0019] Figure 7 shows a flow diagram of the operation of the mailing system of Figure 2 in selecting a robust com- 
bination of descriptors. 

[0020] Figure 8 shows a flow diagram of the operation of the mailing system of Figure 2 in determin ing the robustness 
of algorithms for abstracting characterizing information in accordance with another embodiment of the subject invention. 
[0021] Figure 9 shows a flow diagram of the operation of the mailing system of Figure 2 in determining the robustness 

^5 of algorithms for abstracting characterizing information in accordance with another embodiment of the subject invention. 
[0022] Figure 1 0 shows a flow diagram of the operation of the verification system of Figure 2 in verifying an indicium. 
[0023] In Figure 2, mailing system 22 includes address printer controller 13, address printer 14, postage meter 16, 
and indicia printer 20, which are substantially similar to the corresponding prior art elements shown in Figure 1 . System 
22 differs in including data stores 21 and 23 communicating with controller 13 and :in the manner in which controller 

50 1 3 generates characterizing information. Data store 21 stores a plurality of characterizing of characterizing algorithms, 
as will be described further below, and data store 23 stores at least a print/scan filter which, when applied to the pristine 
image generates a filtered image which approximates the transformation of the pristine image by the printing and 
scanning processes. In other embodiments, data store 26 stores one or more defacing filters which simulate blots, 
smudges, failure of print elements or scanner sensors, or other, similar occasional events which can not easily be 

55 incorporated into said print/scan filter to create one or more defaced images. Together, meter 16, printer 20, form 
secure postal indicia printing system 22. 

[0024] Figure 2 also shows verification controller 25, camera 27, data store 28, storing the characterizing of charac- 
terizing algorithms used in system 22, and diverter 29; which together comprise verification system 30 for comparing 
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address block A with Information recovered from indicium IN and diverting mail pieces which do not match, as described 
above. (System 30 is typically located at a postal facility distant from system 22.) Except as to programming of controller 
25 to carry out the comparison algorithm of the subject invention, as will be described further below, operation of 
verification system 30 is substantially conventional and need not be described further here for an understanding of the 
5 subject invention. 

[0025] Three methods for generation of image-based characterizing information which are believed to provide im- 
proved compactness and robustness in accordance with the above object of the invention, have recently been devel- 
oped by the assignee of the present application and are described below as illustrative of the type of characterizing 
algorithms which can be used with the subject invention. Numerous other algorithms will be apparent to those skilled 
10 in the art and particular choices of algorithms to be used form no part of the subject invention, except as may be recited 
in the claims below and equivalents. 

[0026] A characterizing algorithm where the characterizing information comprises measurements of the lengths of 
the individual words which make up address A, is shown in Figure 3. Address block A is parsed to identify individual 
words by first identifying line spaces Is by determining the occurrence of large amounts of horizontal white space 
15 between blocks of printed text, and then identifying word spaces ws by determining the occurrence of large amounts 
of vertical white space between blocks of printed text (as shown with respect the first line of address A). Word lengths 
/1 through /9 are then determined for address A. Preferably, word lengths are taken (measured in pixels) from the 
edges of word spaces ws (or the address edges) as shown, but can be taken in any convenient manner, such as along 
the midline of the words. 

20 [0027] It is believed that using four or fewer bits per word would not be useful in postal applications. Thus, in a 
preferred embodiment, the number of bits used can be selected to encode all words in the address, and two control 
bits will be sufficient to indicate selection of five to eight bits per word to encode the length of the word. In other 
embodiments, a fixed number of words in the address, for example the first eight, can be scanned at a fixed number 
of bits per word; eight in this case, since control bits would not be needed to specify the number of bits per word. 

25 [0028] Another algorithm, where the characterizing information comprises measurements of the number of "outliers" 
in each word (or each line) which make up address A, is shown in Figure 4. (By "outliers" herein is meant ascenders 
or descenders and portions capitals of which project beyond thresholds, which are preferably determined by the upper 
and lower bounds of lowercase letters without ascenders or descenders, such as "a", "c", "e", etc.) Address A is parsed 
to identify individual words, if necessary, by first identifying line spaces Is by determining the occurrence of large 

30 amounts of horizontal white space between blocks of printed text, and then identifying word spaces ws by determining 
the occurrence of large amounts of vertical white space between blocks of printed text (as shown with respect the first 
line of address A). Otherwise only the lines need be identified. 

[0029] Assuming six bits are allocated per word , the number of upwards (+) and downwards (-) outliers per word can 
be encoded as "xxx/yyy" where x and y are binary digits and xxx is the number of (+) outliers and yyy is the number 
55 of (-) outliers. 

[0030] Another algorithm in which the characterizing information comprises a description of the shape of the address 
block is shown in Figure 5. The shape is determined by using a conventionaTbestfit" scanning algorithm which encloses 
address block A with "best fit" closed curve 50. (It should be understood that various algorithms for generating a best 
fit curve will generate different curves. These differences do not affect the subject invention so long as the same 

40 algorithm is used to generate the curve whose description is incorporated into the indicium and to recover the curve 
from the address block when the indicium is validated.) Preferably, curve 50 is constrained. That is the manner in which 
a curve can be generated is limited so that the resulting curve is simplified and can be described with limited information. 
In Figure 5, curve 50 is formed from linked straight line segments, such as segment 51, which are limited to eight 
"directions", up (U), down (D), left (L), right I, up-right (UR), up-left (UL), down-right (DR), and down-left, (DL); viewed 

45 as being generated starting in the upper left comer of address block A and traveling clockwise around address block 
A. Preferably the curve 50 also accounts for spaces between characters, words and lines, treating these spaces as 
equivalent to printed space, so that curve 50 does not become too convoluted and require extensive descriptive infor- 
mation. It is within the skill of a person skilled in the art to provide an algorithm which will generate robust and compact 
characterizing information, as described above. 

so [0031] The characterizing information, i.e.. the description of curve 50, can be encoded in a number of ways. In the 
present example, the characterizing information consists of only the directions, without lengths, of each successive 
line segment. 

[0032] Programming of a data processor to analyze scan data to perform imaging operations such as identifying 
lines and words, measuring the dimensions of letters and words or fitting a curve to an image in accordance with 
55 predetermined constraints are well known. Such operations are substantially routine in the character and general pat- 
tern recognition arts, for example. Techniques for carrying out such operations are also taught in: Handbook of Pattern 
Recognition and Image Processing edited by T Young and K-S Fu, Academic Press, 1986 and need not be discussed 
further here for an understanding of the subject invention. 
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[0033] Bit streams such as those describe above comprise ordered sequences of values which are typically, though 
not necessarily, numeric values associated with words in the address block. (Such bit streams are hereinafter some- 
times "characterizing information descriptors" or "descriptors" and such values are hereinafter sometimes "character- 
izations".) As described above, when an indicium is validated, i.e., tied to the mail piece on which it is printed, at a 
distant postal facility the descriptor generated from the pristine image and incorporated into the indicium is compared 
with a descriptor recovered from an image scanned from the address block printed on the mail piece. It will be apparent 
to those skilled in the art that the recovered image will be transformed with respect to the pristine image by the char- 
acteristics of the printing and scanning processes, as well as possibly by the occurrence of occasional events such as 
blots. Thus, it is important that the algorithm used to characterize the address block be robust; that is that it produces 
descriptors that match sufficiently when an indicium is valid, and do not match for invalid indicia, despite small differ- 
ences between the scanned image and the pristine image. It will also be apparent that the robustness of a particular 
characterizing algorithm can vary for different address blocks. (As a hypothetical example, the above described algo- 
rithm based on word length may be less robust for address blocks printed in a small font while algorithms based on 
the number of outliers, or address block shape may be relatively insensitive to font size.) 

[0034] Figure 6 shows a flow diagram of the operation of controller 13 in accordance with one embodiment of the 
subject invention. At step 60, controller 13 obtains a pristine digital image, P, of address block A from a conventional 
source (not shown) such as a data processing system for preparing a bulk mailing. At step 62, controller 13 carries 
out printing of address block A in a conventional manner. Preferably, this printing process is carried out concurrently 
with the selection of a characterizing algorithm but, in other embodiments of the subject invention, printing of address 
block A can be carried out sequentially or by a separate processor. 

[0035] At step 64, controller 13 inputs a print/scan filter which simulates the printing process of printer 14 and the 
scanning process to be carried out at a remote postal facility from data store 26 and applies it to image P to generate 
a filtered image, F, which approximates the image which will be scanned from the mail piece at the postal.facility. And 
at step 66 sets index i equal to 1 and variable R equal to 0. 

[0036] At step 66 controller 13 sets index i = 1 and variable R = 0, and at step 70 applies the ith characterizing 
algorithm Cj to images P and F to generate corresponding descriptors C { (P) and Cj(F); each comprising a sequence 
of M characterizations, or values, C { (P)^ through C,(P) M ; C^F^ through C,(F) M . Then at step 72, controller 1 3 compares 
descriptors Cj(P) and C,(F) to estimate a robustness value R, for the ith algorithm C |( with respect to a particular image P. 
[0037] The comparison at step 72 is carried out using a comparison algorithm associated with characterizing algo- 
rithm C, and which preferably is the same comparison algorithm used at the postal facility to compare the descriptor 
recovered from the scanned image with the descriptor incorporated into indicium IN. Preferably, the comparison is 
carried out on a characterization by characterization basis, comparing each C|(P)j with the corresponding C,(F)j to 
determine if the characterizations match; i.e., if they are "close enough" as defined by the particular comparison algo- 
rithm used. (As a hypothetical example, where the characterizations are word lengths they may be considered to 
"match" if the lengths differ by no more than one or two units; while if the characterizations are the number of outliers 
in a word a "match" may require exact equality.) 

[0038] In a preferred embodiment, once descriptors C,(P) and C,(F) have been compared an estimate R s for the 
robustness of algorithm C,, with respect to particular image P, is calculated as: 

R, = Total no. of [C,(P)j matching C^FJjj/M (for j = 1 through M); 

where M is the number of characterizations generated by C,. (Note that since robustness is defined with respect to 
small changes in the image, in normal use the filters, and the printing and scanning processes, will be such that the 
descriptors C,(P) and C,(F) will have the same number of characterizations. Otherwise an error condition is generated.) 
[0039] Once estimate R, is determined at step 74, controller 13 stores R, and Cj(X); where, in different embodiments 
of the subject invention, X can be P or F. That is, the descriptors incorporated into indicium IN can be based on either 
pristine image P or filtered image F. Then, at step 78, controller 13 sets i = i+1 and, at step 80 determines if i > N, 
where N is the number of algorithms to be evaluated. If so, controller 1 3 returns to step 66 to process the next algorithm; 
and otherwise goes to Figure 7. 

[0040] In Figure 7, at step 90 controller 1 3 determines the size of all descriptors which have been generated, i.e. the 
number of bytes required to express each descriptor, and at step 92 sets index j = 1 and variable OR = 0. 
[0041] Then, at step 94 controller 1 3 tests the jth combination, COMBj = C a (X), C b (X), ...C y (X) against predetermined 
rules. In a preferred embodiment, this is carried out by a table look-up which determines whether or not COMBj is 
permitted. Such table can be up dated off-line in response to accumulated experience or heuristic experimentation. 
As a hypothetical example, the rules might require that a particular descriptor be included in permitted combinations 
while prohibiting other particular sub-combinations of descriptors. In other embodiments, each combination is logically 
tested against the rules to determine if the combination is permitted. 
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[0042] Then at step 96, if it is determined that COMBj is permitted, then at step 100 controller 13 determines If the 
size of COMBj Is small enough to fit in the available space in indicium IN. If so, then at step 102 controller 13 calculates 
ORj, the overall robustness of COMBj. Preferably: 

ORj = avg(R ai R b ,... R y ) 

Then, at step 106 controller 13 determines if ORj > OR, and if so at step 108, sets OR = ORj. 
[0043] Then, or if the results at steps 96, 100 or 106 are negative, at step 110 controller 1 3 sets j = j+1, and at step 
10 112 determines if j > 2 N -1, that is if all combinations have been processed. If not, controller 13 returns to step 94 to 
process the next combination , and otherwise at step 1 1 6 sends J and COMBj to meter 1 6 for incorporation into indicium 
IN. The postal facility can then recover J to identify COMBj and use COMBj to validate indicium IN as will be described 
below. 

[0044] Figure 8 shows a flow diagram of the operation of controller 13 in accordance with another embodiment of 
15 the subject invention. Similar to the above described embodiment, at step 90, controller 13 obtains pristine digital 
image, P, of address block A, at step 94 carries out printing of address block A concurrently with the selection of a 
characterizing algorithm and, at step 96 inputs a print/scan filter. 

[0045] At step 100 controller 13 inputs defacing filters D 1 through D T (described above) and applies each of these 
filters to filtered image F to generate defaced images F*D 1 through F*Dt which approximate scanned images of address 
20 blocks which have been defaced by occasional events such as blots. At step 102 controller 13 sets index i equal to 1 
and variable R equal to 0. 

[0046] At step 104 controller 13 applies the ith characterizing algorithm C, to images P, F and F*D 1 through F*Qy to 
generate corresponding descriptors C,(P), Cj(F) and C^F'D-,) through C,(F*D T ); each comprising a sequence of M 
characterizations, or values, C j (P) 1 through Cj(P) M ; C,(F) 1 through C,(F) M , etc. Then at step 108, controller 13 compares 
25 descriptors Cj(P) with descriptors C,(F) and Cj(F*D n ) through Cj(F*D T ) to estimate a robustness value R, for the ith 
algorithm C )( with respect to a particular image P. 

[0047] In a preferred embodiment, once descriptors C,(P) and Cj(F) have been compared an estimate R, for the 
robustness of algorithm C,, with respect to particular image P, is calculated as: 

30 

Rj = Total no. of: [C,(P)j matching C,(F)j (for j= 1 through M) + 
C,(P)j matching C j (F*D k ) j /M (for j = 1 through M, k =1 through T)]/M(T+1); 

35 where M is the number of characterizations generated by C,. 

[0048] Again similar to the embodiment described above, once estimate R ( is determined at step 110 controller 13 
stores C,(X) (where again X can be either P or F depending upon the embodiment) and R,, At step 112, controller 13 
sets i = i+1 , and at step 116 determines if i+1 is greater than N, the number of characterizing algorithms stored. If not, 
controller 13 returns to step 104 to test the next algorithm. Otherwise, at step 120, controller 13 goes to Figure 7 and 

40 continues as described above. 

[0049] Figure 9 shows a flow diagram of the operation of controller 13 in accordance with another embodiment of 
the subject invention in which estimates of the robustness of algorithms C 5 have been previously obtained and stored. 
Such estimates can be predetermined on the basis of experience with use or heuristic experimentation, or in any other 
convenient manner. Again, at step 120 controller 13 obtains pristine digital image, P. of address block A, at step 122 

45 carries out printing of address block A concurrently with the selection of a characterizing algorithm and, at step 124 
sets index i = 1 . 

[0050] In a preferred embodiment, at step 126, controller 13 applies algorithm Cj to image P to generate descriptor 
Cj(P). In another embodiment, additional step 125 is carried out immediately after step 124 to generate filtered image 
F, and step 126a is substituted for step 126 to generate descriptor C,(F). At step 130, controller 1 3 stores Cj(X) (where 
50 again X can be either P or F depending upon the embodiment). At step 132 controller 13 sets i=i+1 and at step 136 
determines if i+1 is greater than N, the number of characterizing algorithms stored. If not, controller 1 3 returns to step 
126 (or 126a depending upon the embodiment) to test the next algorithm. Otherwise, at step 136, controller 13 goes 
to Figure 7 and continues as described above. 

[0051] It is anticipated that other estimates for robustness of characterizing algorithms will be developed as experi- 
55 ence with different applications is gained or will be apparent to those skilled in the art. Accordingly, it should be under- 
stood that, except for particular recitations in the claims below and equivalents thereof, details of particular estimates 
used form no part of the subject invention. 

[0052] Figure 10 shows a flow diagram of the operation of verification system 30 in verifying indicium IN. After en- 
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velope E is scanned by camera 27, at step 140 verification controller 125 inputs a digital scanned image SI of address 
block A, and at step 142 inputs COMBj = C a (X), C b (X), ...C y (X) and index value J. At step 144 controller 25 identifies 
algorithms C a , C b , ...C y from index value J, at step 146 calculates descriptors C a (SI), C b (SI), ...C y (SI), and at step 150 
compares corresponding descriptors. At step 1 52, if the descriptors do not match controller 25 activates diverter 29 at 
step 154 to divert envelope E for inspection; and otherwise, at step 158 sends envelope E on for normal processing. 
[0053] At step 150, in a preferred embodiment, the descriptors are determined not to match if any pair of character- 
izations do not match. That is if for any i,k the characterizations C,(X) k , C|(SI) k do not match then at step 152 no match 
is found. Preferably comparisons are made using comparison algorithms associated with each of characterizing algo- 
rithms Ci, and stores in data store 28. In other embodiments, a predetermined threshold number of characterizations 
which fail to match is required before no overall match is found. In still other embodiments, this threshold may vary 
between 1 and another predetermined value or values associated with particular combinations j and stored in data 
store 28. In other embodiments threshold values are specified in indicium IN. 

[0054] The embodiments described above and illustrated in the attached drawings have been given by way of ex- 
ample and illustration only. From the teachings of the present application those skilled in the art will readily recognize 
numerous other embodiments in accordance with the present invention. Accordingly, limitations on the present inven- 
tion are to be found only in the claims set forth below. 



Claims 

20 

1. A method for generating characterizing information for a selected block of printed material, where said printed 
material is to be scanned from an object and compared with said characterizing information at a location distant 
from where said block is printed, said method comprising the steps of: 



a) printing said block on an object; 

b) determining estimates of robustness for each algorithm in a predetermined set of algorithms; and 

c) selecting, as a function of said estimates, a combination of descriptors generated by a corresponding com- 
bination of said algorithms as said characterizing information. 



30 2. A method as described in claim 1 where said step b) comprises the sub-steps of: 



b1) filtering a pristine digital image of said block of printed material with a print/scan filter to create a filtered 
image, said print/scan filter simulating the expected transformation of said pristine image by printing and scan- 
ning processes; 

35 b2) applying each algorithm from said predetermined set of characterizing algorithms to said filtered image to 

generate a plurality of corresponding second characterizing information descriptors for said filtered digital 
image; and 

b3) for each algorithm from said predetermined set of characterizing algorithms, comparing corresponding 
said first and said second descriptors to determine said estimates of robustness. 

40 

3. A method as described in claim 1 where said step b) comprises the sub-steps of: 

b1) filtering a pristine digital image of said block of printed material with a print/scan filter to create a filtered 
image, said print/scan filter simulating the expected transformation of said pristine image by printing and scan- 

45 ning processes; 

b2) further filtering said filtered image with one or more defacing filters, said defacing filters simulating simulate 
blots, smudges, failure of print elements or scanner sensors, or other, similar occasional events which can not 
easily be incorporated into said print/scan filter to create one or more defaced images; 
b3) applying each algorithm from said predetermined set of characterizing algorithms to said filtered image 

50 and to said one or more defaced images to generate a plurality of corresponding second characterizing infor- 

mation descriptors for said filtered digital image and one or more pluralities of defaced image descriptors 
corresponding to each of said one or more defaced images; and 

b4) for each algorithm from said predetermined set of characterizing algorithms, comparing corresponding 
first characterizing information descriptors with corresponding second characterizing information descriptors 
55 and with each of said one or more corresponding defaced image descriptors to determine said estimates of 

robustness. 



4. A method as described in claim 2 or claim 3 where said selected combination of descriptors comprises said second 
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descriptors. 

5. A method as described in claim 1 where said estimates are predetermined and stored for said algorithms. 

5 6. A method as described in claim 1 where said object is a mail piece and said block of printed material represents 
an address. 

7. A method as described in claim 6 where said selected combination of descriptors is comprised in an indicium 
printed on said mail piece; whereby said selected combination can be recovered from said indicium for use at said 

10 remote location. 

8. A method as described in claim 7 where said indicium further comprises information identifying said combination. 

9. A method as described in claim 1 where selection of said combination is further based upon said descriptors' sizes. 

15 

10. A method as described in claim 1 where selection of said combination is further based upon predetermined rules. 
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